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\  Abstract 

V 

This  paper  contains  an  exposition  of  the  by  now  rather  complete  central  limit 
theory  for  discrete  parameter  martingales  providing  new  and  efficient  proofs.  The 
basic  idea  is  to  start  by  proving  a  central  limit  theorem  under  quite  restrictive 
conditions  (that  the  summands  tend  uniformly  to  zero  and  that  the  sums  of  squares 
converge  uniformly)  and  then  to  obtain  the  most  general  results  by  random  change 
of  time  and  truncation.  The  emphasis  is  on  the  sums  of  squares  (or  squared  varia¬ 
tion  process),  and  Burkholder's  square  function  inequality  plays  a  crucial  role  in 
the  development.  In  particular,  this  approach  leads  to  a  very  short  and  direct 
proof  of  tightness.  In  the  proofs  we  make  much  use  of  a  result ^Lemma  2.5)  which 
is  believed  to  be  new  and  which  binds  together  convergence  to  zero  of  sums  and  of 
sums  of  conditional  expectations.  In  the  final  section,  the  results  are  extended 
to  several  dimensions,  to  mixing  convergence,  and  to  convergence  to  mixtures  of 
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0.  Introduction 


The  main  thesis  of  the  present  paper  is  to  view  the  martingale  central  limit 
theorem  as  basically  concerning  summands  which  tend  uniformly  to  zero,  and  with 
squared  variation  (sum  of  squares)  converging  uniformly,  and  then  to  reduce 
the  most  general  situation  to  this  case  by  (random)  change  of  timescale  and  by 
truncation.  We  think  that  this  both  appeals  to  the  intuition  and  leads  to  quite 
efficient  proofs.  The  purpose  of  the  paper  thus  is  to  give  a  selfcontained  expo¬ 
sition  of  the  basic  martingale  central  limit  theory,  using  this  point  of  view, 
providing  as  simple  and  efficient  proofs  as  possible.  A  second  assertion  we 
would  like  to  make  is  the  usefulness  of  stochastic  processes  point  of  view: 
that  it  is  the  functional  limit  results  which  are  important,  and  not  only  their 
one-dimensional  versions.  There  is,  of  course,  a  cost  associated  with  this:  one 
has  to  learn  at  least  some  elements  on  convergence  of  distributions  on  function 
spaces,  but  the  reward  then  is  both  better  understanding  and  easier  and  shorter 
proofs.  One  further  feature  of  our  development  below  is  an  emphasis  on  the 
squared  variation  process  and  a  systematic  use  of  Burkholder's  square  function 
inequality.  In  particular  this  makes  possible  a  very  easy  proof  of  tightness, 
which  in  other  approaches  often  requires  the  main  effort. 

The  central  limit  theorem  for  discrete  parameter  martingales  represents  one 
important  stage  in  the  development  of  central  limit  theory  and  has  in  the  last 
few  years  reached  what  seoms  to  be  essentially  its  final  form  and  has  also  proved 
its  value  in  many  applications  to  statistics  and  applied  probability.  The  theory 
has  also  been  recast  into  the  language  of  "the  general  theory  of  processes"  of 
the  Strasbourg  school  and  been  extended  to  the  continuous  parameter  case  by  the 
work  of  Rebolledo  [15,16],  Lipster  and  Shirayev  [13,14]  and  others.  This  has  led 
to  e  very  satisfying  formulation  of  the  results  and  a  rather  complete  extension 


of  the  theory.  Nevertheless  it  may  perhaps  also  be  said  that  the  essential  dif¬ 
ficulties  are  present  already  in  the  discrete  case  and  that  the  basic  continuous 
parameter  results  are  rather  easy  to  obtain  from  the  corresponding  results  for 
discrete  time,  as  shown  by  Helland  [9]. 

'•:ccept  for  the  multidimensional  result,  Theorem  4.3,  which  is  only  a  small 
step  away  from  the  one-dimensional  results  (although  it  seems  quite  useful  for 
applications),  none  of  the  theorems  of  this  paper  is  new.  However,  almost  all 
the  proofs  are  new  (the  main  idea  was  mentioned  briefly  by  the  present  author  in 
[19]  and  was  developed  in  some  detail  in  mimeographed  lecture  notes  from  the 
Department  of  Mathematical  Statistics,  Copenhagen  University) .  In  particular 
we  would  like  to  point  out  Lemma  2.5,  which  is  a  versatile  tool  and  which  is  new 
formulated  in  the  present  generality,  although  various  special  cases  have  been 
used  by  many  authors. 

A  related  exposition,  which  starts,  however,  by  assuming  known  a  basic  cen¬ 
tral  limit  theorem  for  bounded  martingale  differences  is  given  by  Helland  [9]. 

A  further  rather  different  exposition  which  uses  the  Skorokhod  embedding  is 
contained  in  the  recent  book  by  Hall  and  Heyde  [8],  Both  expositions  contain 
extensive  lists  of  references  and  accounts  of  the  development  of  the  subject,  to 
which  we  refer  the  reader.  Later  papers  of  interest  include  work  by  Klopotowski 
and  colleagues  [2,12],  the  articles  by  Lipster  and  Shiryaev  mentioned  above,  and 
a  series  of  papers  by  Jeganathan  [10,11].  The  approach  in  [11],  which  in  turn 
was  partly  inspired  by  Ros6n  [20],  is  somewhat  related  to  this  paper  and  was 
made  independently  of  it. 

The  plan  of  the  paper  is  as  follows.  Section  1  contains  some  notation,  and 
in  Section  2  the  results  we  need  from  other  areas  (functional  limit  theory  and 
martingales)  are  collected,  and  the  basic  truncation  lemmas  (Lemmas  2. 5-2. 7)  arc 
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obtained.  The  functional  central  limit  theorem  for  martingales  is  then  proved  in 
Section  3,  starting  from  scratch,  and  finally  Section  4  contains  a  somewhat  brief¬ 
er  discussion  of  one  direction  of  extension  of  the  results,  to  several  dimensions 
and  to  convergence  to  mixtures  of  normal  distributions. 


1.  Notation 


Throughout,  we  will  consider  doubly  indexed  arrays  {X^  ,  8^  j>l  .  n>l) 

where  the  X  ,'s  are  random  variables  or,  in  Section  4,  random  vectors,  and  for 

n.J 

each  n,  {8  ,  is  an  increasing  sequence  of  sigma -algebras,  i.e.,  8  .cR  . 

n,j  j  =  l  &  m  5  *  ’  n  j~  n , .i  + 1 

We  will  never  assume  that  the  X^  ..'s  are  obtained  by  linearly  renormalizing  a 
single  sequence  of  random  variables  since  that  is  not  the  case  in  many  of  the  ap¬ 
plications  but  will  sometimes  assume  that  the  sigma-algebras  are  nested,  i.e.  that 


8  i  -  8r,+i  i  >  for  n’j-]  ' 
n.J  n+1 ,  i 

which  seems  to  hold  in  most  cases  of  interest.  Possibly  by  going  over  to  a  pro¬ 
duct  space,  we  will  assume  that  all  X  .'s  and  8  .'s  are  defined  on  the  same 

n  >  3  ^  9  j 

probability  space  (12, 3, P)  .  The  array  is  said  to  be  adapted  if  X^  .  e  8n  .  for 

j>l,  n>l ,  and  it  is  a  martingale  difference  array  (m.d.a.)  if  in  addition 

{Xn  j,  j ;  j=l,2,...}  is  a  sequence  of  martingale  differences,  i.e.,  if 

E  |  X  .  |  <  °°  and  E(X  .  .  II  8  .)  =  0  for  j>l. 

A  stochastic  process  (x(t)},  defined  for  t  in  some  interval  I  is  a  time-scale 

if  it  is  nondecreasing,  has  left  limits  and  is  right  continuous.  A  sequence  {x^} 

of  time  scales  is  adapted  (to  (B  .})  if  for  each  n  and  tel,  x  (t)  is  a  stopping 

n ,  j  n 

time  with  respect  to  8  ,,  8  ..... 

K  n,l  n,2 

Let  S  (t)  =  y!*]  X  .be  the  partial  sum  in  the  n**1  row,  and  let 

nv  L)=\  n,j  * 
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{8(t) ;  tel}  he  a  standard  Brownian  motion.  The  problem  we  are  concerned  with  is 

convergence  in  distribution  of  time-scaled  row-sums  S^ot^  to  a  Brownian  motion, 

or  more  generally  a  time-scaled  Brownian  motion.  Here  of  course  S  oj  is  defined 
7  n  n 

by  S  ot  (t)  =  S  (t  (t)).  For  brevity  of  notation  we  will  usually  write  Sot  for 
3  n  n  n  n  '  1  1  n 

S^ot^,  and  E.(*)  for  E(*||  8^  ^)  (with  EQ=E)  when  taking  expectation  of  variables 

.  th 

in  the  n  row. 

A  partition  of  an  interval  [0,T]  is  a  finite  set  of  points,  0=t  <t^< . . ,<t^=T. 

For  a  given  partition,  we  will  write 

(1.1)  A(£)  =  t„  -  t„  ,  A  =  max  A (-£) 

*  \<l<k 


ASOt  (£)  =  Sot  (t.)  -  Sor  (t»  ) 

n  n  -c  n  -c-1 


VSOT  (t) 


sup 


k 

I 


n,  J 


Further,  indicator  functions  will  be  written  as  lg  or  l{  },  i.e.  lg C to)  is  one 

if  coeB  and  zero  if  0JeBC,  and  similarly  l{  }  is  one  if  the  event  in  curly  brackets 

occurs  and  zero  otherwise.  Finally,  sums  with  u; per  limits  which  are  not  integers 

cx  o  r  x 1 

are  defined  by  =  i-e*  summation  is  up  to  the  greatest  integer  which 

does  not  exceed  the  upper  limit. 


2.  Prerequisites:  Functional  Limit  Theorems.  Martingales.  Approximation 

For  easy  reference  and  because  one  purpose  of  this  paper  is  to  make  a  complete 
exposition  of  the  martingale  central  limit  theorem,  we  will  in  this  section  list 
the  results  on  convergence  in  distribution  and  on  martingales  which  are  needed 
for  the  proofs.  The  section  furthermore  contains  three  lemmas  which  arc  essential 
for  the  truncation  and  approximation  procedures  we  will  use. 


l.ot  X,  {x  }  he  random  variables  with  values  in  a  complete  separable  metric 
space  (S,p).  With  standard  terminology  and  notation,  converges  in  distribu¬ 
tion  to  X,  i  X  in  (S,p),  if  MX^)  +  h(X)  in  F  ,  for  all  functions  h:  S  -*•  F 
which  are  bounded  and  continuous  almost  surely  with  respect  to  the  distribution 
of  X,  and  X^  converges  in  probability  to  X,  X^  +  X  if  p(Xn>X)  ^  0.  If  X  has  a 
standard  normal  distribution  in  F^  we  also  write  convergence  in  distribution  as 
Xn  +  Nd(0,I')  and  for  d  =  1,  as  X  +  N(0,1).  Besides  F,  F^  ,  we  will  be  inter¬ 
ested  in  the  metric  spaces  D[0,T]  and  D[0,°°)  of  functions  on  [0,T]  and  on  [0,°°) 
which  have  left  limits  and  are  right  continuous,  with  metrics  described  in  [3,21] 
and  in  the  subset  [ 0 , 1 ]  of  nondecreasing  functions  in  D[0,1],  Convergence  in 
distribution  of  vectors  will  throughout  be  with  respect  to  the  relevant  product 
metric . 

The  first  result  is  a  criterion  for  convergence  in  D[0,°°) .  It  can  be  obtained 
as  an  easy  special  ease  of  the  results  of  [21,  Theorem  2.8]  and  [3,  Theorem  15.5 
amended  with  the  argument  of  Theorem  8.3]. 

Proposition  2.1.  With  the  notation  of  (1.1),  suppose  the  following  two  conditions 
hold: 

(i)  (tightness)  for  each  positive  T  and  e,  there  exists  a  function  f  such  that 
for  any  partition  of  [0,T] , 

k 

limsup  l  P(vSox  (£)>e)  <  f(A)  , 
n~°  1=1  n 

where  f(A)  -*■  0,  as  A  •+  0,  and 

(ii)  (finite  dimensional  convergence)  if  ik  }  j  is  a  sequence  of  integers  and 

X  a  continuous  stochastic  process  with  Sot^  ^  X  in  D[0,°°)  as  k  -*•  <*,  it  follows 

n 

that  X  has  the  same  finite-dimensional  distributions  as  B. 

Then  S°Tn  ^  >  as  n  -*■  «  . 

The  next  results,  on  "random  change  of  time"  and  approximation  are  phrased 
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in  terms  of  general  processes  Xn>Yn  sod  Y  in  DfO,00)  or  D[0,1]  and  timescales  {i^}. 

Proposition  2.2.  (i)  Suppose  Yn  >  Y  in  D[0,°°)  and  that  (t  (t);  te[0,l]}  are 

timescales  such  that 

(d.l)  Tn(t)  £  T(t)  ,  as  n  +  “>  , 

for  each  te[0,ll,  where  t  is  a  non-random  continuous  function.  Then  Y  ot  ^  Yot. 

n  n 

(ii)  Suppose  Y^  ^  Y  in  D[0,1]  and  that  {X^}  are  random  variables  in  n[0,l],  such 
that 

sup  |X  (t)  -Y  (t)  |  £  0  ,  as  n  -♦  00  . 

0<t<l  n  n 

Then  Xn  +  Y  in  0 [0, 1]  . 

Proof:  (i)  Since  pointwise  convergence  of  increasing  functions  to  a  continuous 

limit  implies  uniform  convergence,  (2.1)  implies  that  +  t  in  DQ (0,1]  .  and  the 
result  then  follows  from  [21,  Theorem  3.1]. 

(ii)  See  [3,  Theorem  4.1].  H 


From  martingale  theory  we  will  use  some  simple  consequences  of  the  optional 
sampling  theorem  (Proposition  2.3  below),  the  extension  of  Kolmogorov's  inequality 
to  martingales  (Proposition  2 . 4 ( i ) )  and  one  half  of  Burkholder's  square  function 
inequality  (Proposition  2.4(ii)).  All  of  this  belongs  to  the  standard  fare  from 
a  first  encounter  with  martingale  theory,  except  perhaps  the  square  function  in¬ 
equality.  An  elementary  (albeit  pedestrian)  proof  of  this  latter  result  is 
sketched  in  the  appendix  for  the  special  case  we  shall  need--an  elegant  proof 
for  the  general  case  is  given  in  [4] . 


Proposition  2.3.  Let  {X^.,8^.;  j >1 }  b2  a  sequence  of  martingale  differences,  let 

t<t'  be  stopping  times,  and  write  8^  for  the  pre-T-sigma-algebra .  Suppose  that 
r  t  •  2 

E):  ,  XT  <  00 .  Then 

^j  =  l  j 


t' 


E  (  l 

j=T  +  l 


8  }  =  0 
T 
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E«I  X  )2||  8  }  =  E{  l  X^||  8  }  . 

j=T  +  l  •  i  =  T  + 1  J 


Proposition  2.4.  Let  {X^,8^}  be  a  martingale  difference  sequence  and  let  t<i 
be  stopping  times.  Then 

(i)  for  any  integer  n  and  real  numbers  p>l ,  e>0, 

k  ,  t ' An 

P(  max  |  j  X  |>e)  <  ~  E|  l  X  |P  , 

T<k<r'An  i=T+l  J  j=T+l  J 

( i i )  for  p>l  and  C  a  constant  which  only  depends  on  p, 


l  X  |P  <  CE(  l  Xp 

i=x+l  -  i=T+l 


2,  P/2 


( i  i  i)  P(  max 
r<k< 


x  |  l  X  1 20  s  E(  l  xp 
<t'  i=T+l  •  eF  j=t+l  J 


2->  p/2 


Proof:  (il  is  the  extension  of  Kolmogorov's  inequality  applied  to  the  martingale 

x . and  (ii}  is  one  of  Burkholder's  square  function  inequalities. 
Further,  combining  (i)  and  (ii)  we  have  that 


P(  max 
T<k>T ' An 


1  xN)<^f.(J 

i=T  +  l  •  f  j  =T  + ! 


_  T '  An  . 

^  XppA 


and  (iii)  follows  by  letting  n-x*. 


As  will  be  seen,  it  is  convenient  to  have  an  easy  means  of  comparing  i  he  size 
of  a  sum  of  positive  variables  with  the  sum  of  their  conditional  expectation.  In 
the  present  context,  special  cases  of  the  following  result  hes  been  used  by 
several  authors,  but  the  result  itself--and  its  easy  proof--is  believed  to  be  new. 

Lemma  2.5.  Suppose  (Z^  . ,  8^  .}  is  an  adapted  array  of  positive  random  variables 
and  that  Tn  is  a  stopping  time  with  respect  to  j,  8^  • • • >  for  each  ti.  Then 

T  T 

rn  p  rn  p 

m  V"  r  iz  )  £  o  •»  l 

i  =  l  J  j  =  l 


-  •/' 
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and 

f  oo 

(ii)  if  {  max  Z  . }  is  uniformly  integrable  then 

l<j<T  n,J  n 
n 


11  n  11  n 

J  z  .  S  a  =>  y  e ■  , (z  .!  1  n 

ji'i  n’1  i,i  "■> 


Proof :  (i)  For  any  stopping  time  v  ,  letting  N  tend  to  infinity  in  the  identity 


v  aN 

n 


F  y  Z  .  =  y  El (i<v  }Z 


i  =  l 


n,.i 


i  =  l 


n  n, 


=  l  E { 1  { j <\>  }E(Z  .  ||  B  .  ,)} 

J  r\  ■n  t  1 1  -n  t  _  I 


i  =  l 


v  aN 
n 


=  E 


,i  =  l 


n  n ,  i  1  n ,  j  - 1 


E(Z  .  ||  8  .  1  . 

n».i  n,]-l 


shows  that 


-'n 

t:  Y  7 


^n 


Let 


y  z  .  =  e  y  e  .  (z  .1  . 

j=i  n’’  j=i  -1-1  n*J 


N 

J'  =  inf {k > 1 ;  7  E.  . (Z  .)>!} 

n  .1-1  n  »•  J 


v  =  (v'-I)at  , 
n  n  n 


and  note  that  P-1 ,  and  hence  v^,  is  a  stopping  time,  and  clearly  P(v 
r  ”^n  P 

since  ,  E.  ,(Z  .)  -*■  0.  Further,  since  v  <t  , 

P  =  1  j-P  n,p  n  n 

v 

rn  p 

Ej-l(Zn,j^  "°  ’ 

u 

.  ,  E.  ,(Z  )^1,  since  v  £  v'-l,  so  the  sum  is  uniformly  integrable 

1=1  .1-1  n,j  n  n  -  b 


and  hence 
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[i  u 

E  y  3  •  =  E  l  E.  ,(Z  .)  -  0  , 

i  =  l  "-J  j  =  i  -I'1 


as  n 


Thus,  for  any  r>0, 

T  V 

n  n 

P{  I  Z  .>c)  <  P(T  =<v  )  +  P(  y  Z  .>c)  ->  n 
j  =  l  n>J  n  n  A  n ,  j 

using  Chebvcheff's  inequality  for  the  last  term. 

(ii)  Define  in  this  case 

k 

o’  =  inf{k>l;  T  Z  .>1}  , 

n  ".J 


as  n  1 


v  =  v 1  AT 
n  n  n 


V 

so  that  v  again  is  a  stopping  time,  and  note  that  0  s  I.11,  Z  .  <  1  +  max{Z 

n  v  n,j  n,  j 

1  Bv  the  assumption  this  shows  that  Z^  ^  is  uniformly  integrable, 

and  the  proof  can  then  be  completed  in  the  same  way  as  part  (i).  H 

The  proof  of  the  following  frequently  used  result  is  left  to  the  reader. 

Lemma  2.6.  Let  {X  }  be  real  random  variables.  Then 
-  n 

p 

(i)  X  -»  0  if  and  only  if  there  exists  constants  e  ■+  0,  such  that  PCX  >r  )-»0, 

n  n  n  n 

as  n  >  °°,  and 

C C  p 

(iil  {X  }  ,  is  tight  if  and  only  if  X  /a  -+  0  for  any  sequence  (a  }  of  constants 

n  n=l  n  n  n 

such  that  a  +  <"  as  n  -*  os. 
n 

Combining  Lemma  2.6(ii)  and  Lemma  2.5,  and  noting  that  (max{Z  l<i<i  1}  , 

n,j  n  n=l 

is  uniformly  integrable  implies  that  {max  {Z  ./a  ;  l<j<T  }}  ,  is  uniformly  inte- 

‘  n.i  n  J  n  n=l 

g ruble  if  a  -►  »,  leads  to  the  next  lemma, 
n 

Lemma  2.7.  Let  (Z  . ,8  .}  and  {x  }  be  as  in  Lemma  2.5.  Then 

- — — —  n ,  j  n ,  j  n 

m  if  {^j  =  l  Ej-l(Zn,j)]n=l  is  tight  then  {^j  =  l  Zn,.i}n=l  is  tight’  and 

(ii)  if  (max{Z  l<i<T  }}  .  is  uniformly  integrable,  and  {Y . n ,  Z  . ,  is 

v  n,j  7  n  n=l  7  s  ^  =  1  n ,j  v-1 
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t 

r  r  Tl  ■,  oo 

tight,  then  {) .  E ,  Z  .}  .is  tight. 

Lj  =  l  l-l  n,.i  n=l  6 


3.  Functional  Central  Limit  Theorems  for  Martingales 


Perhaps  the  most  intuitively  appealing  explanation  of  the  martingale  central 
limit  theorem  is  the  Levv-Doob-Dubins-Schwarz  characterization  of  the  Brownian 
mot  ion --a  martingale  which  has  continuous  sample  paths  and  squared  variation 
equal  to  the  identity  is  necessarily  a  Brownian  motion  (see  [6]).  (However, 
this  of  course  goes  both  ways:  The  martingale  central  limit  theorem  on  the 
other  hand  throws  light  on  the  characterization,  and  it  can  be  used  to  provide 
a  simple  proof  of  it) .  We  start  by  proving  an  approximation  version  of  the 
characterization,  informally  that  if  the  jumps  are  uniformly  small  and  the 
squared  variation  is  uniformly  close  to  the  identity,  then  a  martingale  is  ap¬ 
proximately  a  Brownian  motion.  In  the  proof  of  finite  dimensional  convergence, 
we  use  ideas  borrowed  from  Kunita  and  Watanabe's  proof  of  the  characterization  of 
Brownian  motion  but  could  as  well  have  used  the  customary  proof  of  finite  dimen¬ 
sional  convergence,  as  e.g.  in  [8],  which  simplifies  considerably  in  the  present 
situation.  However,  the  present  proof  seems  to  tie  in  better  with  our  point  of 
view.  Once  this  result  has  been  proved,  the  most  general  central  limit  theorems 
for  martingales  follow  simply  by  random  change  of  time  and  truncation. 

Lemma  3.1.  Suppose  {X^  j>^n  y  j-1 >  n>l}  is  a  m.d.a.  and  { ( t) ;  t>0)  adapted 

timescales,  with  t  (0)=0  such  that  there  exist  constants  f.  -*-0  satisfying 
n  n  ’ 

Tn(t)  „ 


(3.1)  | X  . |  <  e  ,  I  y  X2  .  -t|  <  e  , 

1  n,  j  1  n  1  n,  j  1  n 

for  all  j,n>l  and  t50.  Then 


Sot  “  B  ,  as  n  -*■  in  D[0,°°) 
n 


»  “  -vr  r 
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I’r'of:  llsint'.  the  notation  of  (1.1)  and  of  Proposition  2.3  we  will  verifv  the 

hypothesis  of  Proposition  2.1. 

(i)  Tightness:  By  applying  in  turn  Proposition  2.4(iii)  for  p  =  4  and  (3.1) 

we  obtain  that  ,  . 

k  k  W 

y  P(vSox  (£)>e)  <  l  4E(  1  *1  ,)2 


1=1 


£ale  i=Wi)+i 


Since 


"  7  Ji11*"1*-1*26!.5  ■ 


k  2  k  2 

limsup  l  (t.-t.  . +2e  )  =  l  (t.-t-  ) 

n+~  £=1  1  n  1=  1 


<  AT  , 


this  proves  tightness. 


(ii)  Finite  dimensional  convergence:  We  have  to  prove  that  if  Sor^  S  X  asn-t®, 

n 

where  X  is  continuous,  then  X  has  the  same  finite  dimensional  distributions  as 

B.  For  simplicity  of  notation,  we  will  assume  that  k  =n,  so  that  Sot  +  X.  The 

n  n 

general  case  is  then  obtained  simply  by  changing  n  to  k^  in  the  computations  be¬ 
low.  By  the  continuity  of  X,  SoT^(t)  +  X(t),  for  t>0,  so  that 

iuSOT  (t)  .  v,.. 

Ee  ->  Ee  =  $  (u)  ,  say. 


for  each  u  and  each  t>0  and  we  only  have  to  show  that  <f>t(u)  is  the  characteris¬ 
tic  function  of  B(t)  and  the  corresponding  fact  for  the  k-dimensional  charac¬ 
teristic  functions. 

Again  with  the  notation  of  (1.1), 
iuSox  (T) 


k  iuSox  (t  )  iuASOT  (£) 
e  n  (e  n  -  1) 

£=1 


(3.2)  e  n'  '  -  l  =  y  -  n  £-1'  '-  n 


k  iuSox  (t»  .)  2 

=  l  e  "  {iuASox  (£)  -  ~  ASot  (£)  2)  +  r 

n  z  n  n 
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where,  hv  Taylor's  formula 


lrJ  5  T(  J,  I“OT„«)|3  ■ 


(»1 


Thus,  using  first  Proposition  2.4  (ii)  with  p  =  3  and  then  (3.1) 


Erl  < 


~  3 

k 

T„Ctf' 

Cu 

3 ! 

I 

E(  l 

5 

f=l 

k 

j=T„(to 1>41 

Cu 

3! 

I 

£=1 

(t£"t£-l*2S)3 

X2  ,)3/2 
n,] 


and  hence,  for  K  =  C  u  /3!, 


(1/2, 


(3.3)  limsup  E|r  |  <  KA  T  . 

n-*» 


Now,  with  the  obvious  identifications,  the  hypothesis  of  Proposition  2.3  is 
satisfied,  and  hence 


k  iuScn:  ( t n _ .a )  k  iuSox  (to  ) 

(3.4)  E  l  e  n  ASor  (f)  =  )  E{e  n  E(ASot  (i)  ||  .)) 

**  o  *  *1  n,  |  J 


i=i 


1=  1 
=  0 


and 


k  iuSox  (to  )  ?  k  iuSox  (to  .)  , 

(3.5)  E  l  e  n  4-1  ASOx  (£)  =  £  E{e  n  E (ASot  (£)  II  f  , 

1=1  n  1=1  n  "’VV-l1 


iuSox  ft.  )  Tn*-V  ? 

=  l  E(e  n  l~l  E(  l  X2  .  |t  B  .)} 

i-V't-l"1  n’Tn  l£.l 


k 

J. 


k  iuSox  (to  ) 

=  l  Ee  n  A(f)  +  R  , 


t  =  1 


where 


W 


(3.6)  |R  |  s  I  E |  l  X*  .  -  4(0 1 


1=1  i=Tn(t(-l)41 


n,J 


*  J  ft-  * 
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<  k  c  *-0  ,  as  n  +  “ 

n 

by  (3.1). 

Taking  expectations  of  both  sides  of  (3.2),  inserting  (3.3)-(3.6),  and  letting 
n  ->  <»  now  proves  that 

2  k  n 

|*T(u)  -  1  4  h.  (u)  |  <  KA17  T  . 

T  Z  1=1  tl-l 

Since  the  partition  0=tQ<t^< . . .<t^  =  T  is  arbitrary,  this  shows  that  4>t(u)  is 
Rieman  integrable  in  t  and  that 

2 

4>t(u)  -  l  =  -  \  Jj  4>t(u)dt  . 


Since  4>n(u)  =  1,  the  only  solution  to  this  equation  is 
0  2 

t 

.  ,  .  2  iuB(t) 

4>t(u)  =  e  =  Ee 

To  conclude  the  proof  it  only  remains  to  prove  the  corresponding  result  for  the 
multidimensional  characteristic  functions.  However,  if  uSot^CT)  is  replaced  by 
u(t)dSOTn(t) ,  where  the  function  u  is  assumed  to  be  piecewise  constant,  with 
only  finitely  many  jumps,  then  the  same  calculations  show  that 

E  exp (i /p  u(t) dX(t) )  =  exp(-l/2/^  u(t)^dt) 

=  E  exp(i /q  u(t)dB(t))  .  i~1 

The  first  step  in  weakening  the  hypothesis  of  Lemma  3.1  is  concerned  with  the 
second  part  of  (3.1). 


Lemma  3.2.  Suppose  {X  .,8  .}  is  a  m.d.a.  and  {t  (t) ;  tef0,ll}  are  timescales 

-  11  n,j  n,j  n  1  J 

such  that 


(3.7) 


|x  .  I  <  e  ->-0  ,  as  n  +®,  and 

1  n, j 1  n 


n  2  p 

I  X  .  +  t ( t)  ,  as  n  +  * 
j  =  l  n,J 
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for  all  n,j>l  and  te[0,l],  where  x  is  non-random  and  continuous,  then 
(3.8)  Sox^  ^  B  ,  as  n  ■>  “  ,  in  D[0,°°). 

Proof :  Define,  for  t>0 


Vt3 


t 


l 

j=l 


n'V)  =  inf{s>0;  rin(s)>t}  , 

(where  we,  without  loss  of  generality,  assume  that  p^(t)  a-+s'  <*>  ,  as  t  ->  «•)  so 

that  (n  }  and  {p  *}  are  timescales  and  p  ^  in  addition  is  adapted.  Clearly 
n  n  n  1 


nn1(t) 

t<  l  x2.,t  +  e2, 

j  =  l 


and 


hence  S  op  1  ^  B  in  D[0,°°)  ,  by  Lemma  3.1.  Since  furthermore  {p  ot  }  are 


n  n  n  n 

timescales,  and 

T  (t) 

n  2  p 

p  ox  (t)  =  Y  X  .  t  ,  as  n  -*•  00  , 
n  n  j=i  n>J 

for  te[0,l],  by  assumption, Proposition  2.2  (i)  implies  that 

S  op  *op  ox  -+  Box  ,  in  D  [0,1] . 
n  n  n  n  L  1 

It  is  easily  seen  that  moreover 

|SOTn(t)  ‘  ®nOT1n*°r1nQTii^t^  ^  £  '  Vk) -Sn°\1°Vk>  ' 

<  e  00  ,  as  n  +  “  , 

n 

and,  by  Proposition  2.2(ii),  this  proves  (3.8).  fl 

In  a  sorse.  Lemma  3.2  says  all  there  is  to  say  about  the  central  limit  theorem 


for  martingales,  since  if  Soxn  converges  to  Brownian  motion,  then  the  maximum  of  the 


il 


Xn  . 's  has  to  tend  to  zero,  which  (more  or  less)  leads  to  the  first  part  of 
(3.7),  and  then  the  second  part,  with  r(t)  =  t  is  minimal.  However,  we  will 
derive  further  conditions,  which  may  be  easier  to  check.  The  first  result 
applies  not  only  to  m.d.a.'s,  but  to  arrays  which  are  asymptotically  close  to 
m.d.a.'s  in  an  appropriate  way. 

Theorem  3.3.  Suppose  that  (X^  j>^n  j»  j-l»  n>l)  is  an  adapted  array  and  {xn(t); 
te [0,1]}  are  adapted  timescales  such  that,  for  some  a>0, 

T  (t) 

H  r\ 


(3.9) 


l  \E.  . (X  . H | X  .  | <a})  |  -  0 
J-1  n.J  n, j 


Tn(t) 


l  X*  .  £  f(t)  •  as  n  -*•  °°  , 
j=l  n,J 

for  te  [0, 1] ,  where  t  is  non-random  and  continuous.  Then 

(3.10)  Sot^  +  Box  ,  as  n  ->  00  ,  in  D[0,1]. 

Further,  if  (3.9)  holds  for  one  a>0,  then  it  holds  for  all  a>0. 

Proof:  It  is  easily  seen  that  the  second  part  of  (3.9)  implies  that 

max{|x  . |;  l<j<x  (1)}  5  0,  and  thus,  by  Lemma  2.6(i),  there  exist  constants 
n ,  j  n 

€  -*■  0  such  that 

n 

(3.11)  P(  max  |x  .  |  >  e  )  -*•  0  ,  as  n  +  “  , 


I<j<xn(l) 


h,j'  n' 


Hence 


Jj  lXn,3 |l{aa|Xn,j*>En>  *  0  •  as  "  *  "  ' 

since  all  the  summands  are  zero  on  the  set  {  max  |x  . |se  },  and  then,  according 

isj<xn(l)  n*J 

to  Lemma  2.5(ii), 


V” 


‘•12>  EMfl*„fjln«lxn>jl>s»  *  0  ■ 


as  n  -►  00 
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Thus,  using  the  first  part  of  (3.9), 


Tn(1> 


V1) 


(3 


.13)  £  I^Vj^VjK^  s  Jml  iEj-l(Xn,.il{lXn,i^a})l 


Tn<” 


+  y  E .  , C I X  . 1 1 {a> | X  . I >e 
j'=1  J-l  1  n,j  1  '  n.j  1  n 


}) 


->0  ,  as  n  -+  «>  . 


Now,  put 


X'  . 
n,  j 


X  .1  {lx  . \<t  }  ,  X"  .  =  X  .  -  X'  .  , 

n.j  n, j  n  n, j  n, j  n, j 


T  (t) 
n 


Y  .  =  X'  .  -  E.  .(X'  .)  ,  S'OT  (t) 

n.j  n, j  j-lv  n.j-’  nv 


l  Y  . 

j=l  "’-1 


Then  by  (3.11)  and  (3.13) 


Tn(1) 


t  (1) 
n  ’ 


sup  | Sot  (t)  -  S'OT  (t) I  <  l  |x»  |  +  y  | E .  (X' 

0<t<l  j=l  j  =  l  1 


->  0  ,  as  n  +  K  , 

and  thus,  according  to  Proposition  2.2(ii),  the  conclusion  (3.10)  will  follow  if 


we  prove  that  S'OT  satisfies  the  conditions  of  Lemma  3.2.  Clearly  {Y  ,,B  .}  is 

n  n ,  j  n ,  j 


a  m.d.a.,  and  |Y  . I  <  2e  ,  so  it  only  remains  to  be  shown  that 
n.j'  n 


T  (t) 

nr  2  P 

(3.14)  y  Y  .  +  t ( t)  ,  as  n  +  ® 

j=l  n,;) 


for  all  te [0, ll .  Here 
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As  an  easy  corollary,  we  will  now  obtain  conditions  which  insure  that  norm  ini', 
with  sums  of  squares  and  with  conditional  variances  is  asymptotically  equivalent. 

Corollary  3.4.  Let  {X  .,8  .}  be  an  adapted  array,  and  (x  (t) ;  te[0,l]}  adapted 


n,j 


timescales , 


(i)  If 


Tn(1> 


(3.15)  y  E.  , (X  . 1 {| X  ,|> 
n'3  n  < 1 


e})  a-  0  ,  as  n  00  ,  Vc>0  , 


t  (1) 


Tn(-1-)„2 


and  if  either  {V.n  E  (X  .)}  ,  or  {T.n  X  .}  is  tight  (which  in  par- 

Lj=l  j-1  n, j  n=l  Lj=l  n,j  n=l 


ticular  holds  if  either  sum  converges  in  probability)  then 


T  (t) 
n 


Tn^ 


sup  | 
0<t<l  j 


l  x;  -  I  e  j(x‘  .)|  z  o  , 

j=i  n’J  j=i  3-1  n’J 


as  n  -v  00  . 


(ii)  The  hypothesis  (3.15)  is  equivalent  to  the  assumption  that  (3.15)  holds  for 


P 

one  fixed  c>0,  and  that  max(|X  . |;  l<j<x  (1)}  -*  0. 

ft  $  )  ^ 


T  ( 1 ) 


~2  .  P 


Proof:  (i)  Write  X  .  =X  .l(|x  ,|>l).  Then  by  (3.15),  Y"  E.  . (X2  . 

-  n,j  n, j  n, j  7  L)-\  j-lv  n,i 


)  +  0. 


vTn^^~2  P 

and  it  follows  from  Lemma  2.5(i)  that  Xn  .  -*■  0,  so  we  may  in  the  proof 


assume  that  |X  .1  <  1,  for  n,j>l 
1  n,j 


rrTn*"1'*  2  ,«>  rvTn^1'*  2 

Then  both  {)  .  ,  E.  ,(X  .))  ,  and  {)  .  .  X  .}  ,  are  tight,  by  the  assump- 

^j=l  j-1  n,j  n=l  Lj=l  n,j  n=l  1  1 

tion  combined  with  Lemma  2.7.  Further,  (3.15)  implies  that 


(3.16)  M  =  max  E.  ,  (X2  .)  ■+  0  ,  as  n  +  »>  , 

11  l<j<T  (1)  3'1  "’J 


SjST  (1) 


and  by  Lemma  2.5(i),  that 


V1J 


y  *2  -^\k  j  >e}  >0  ,  as  n  ->  00  ,  Ve>0  , 

j  _  j  ti ,  j  n ,  j 


which  in  the  same  way  gives  that 
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(3.17)  M 

n 


2  P 

max  X  .  •+  0 

b'i<i  (1)  n,J 
n 


as  n  -*•  00  . 


C 1  er  r  1  v ,  { Y 

n,.i 


■>  a 

X“  .  -  E.  ,X“  .}  is  a  m.d.a.  with 
ii ,  .1  J  - 1  n ,  j 


Y 


n,  J 


<  2,  and  since 


Tn(t) 


l  Y 


Tnm 


.1=1 


n,J 


2{  l 

j=l 


X4  .  + 
n,J 


tn(l) 


j=l 


E.  (X2  ,)2} 
J-l  n,J 


<  2M 


n 


Tn(1) 

l  ■ 

j  =  l 


V1^ 


n>  J 


+  2M 


j  =  l 


E.  ,(X*  .) 
J-l  ".J 


P  n 

-*■  0  ,  as  n  -*■  <=°  , 

hv  tightness  and  (3.16),  (3.17),  and  the  Corollary  then  follows  from  the  theorem 
(with  x(t)  7  0) . 

p 

(ii)  Prom  (3.17)  follows  that  (3.15)  implies  that  max{|X  ,|;  l<i<r  (1))  -*•  0. 

ti  ,  J  T1 

The  other  implication  follows  in  the  same  way  as  the  last  assertion  of  the  theo¬ 
rem.  n 


We  can  now  prove  the  general  functional  central  limit  theorem  for  martingales. 
Of  course,  the  most  important  special  case  of  it  is  when  x(t)  it,  and  the  limiting 
process  is  an  ordinary  Brownian  motion. 

Theorem  3.5.  Suppose  {X^  ^,8^  ^ }  is  a  m.d.a.,  {x^ft) ;  tefO.l]}  are  adapted  time- 
scales  and  r(t);  te[0,l]  is  a  continuous,  non-random  function  and  suppose  that 
one  of  the  following  three  sets  of  conditions  holds: 

E  max  |x  .  |  -+•  0  ,  as  n  ■+  ”  , 

l<j<x  (1)  n,J 
(3.18)  n 

T(t) 

n  <-y  p 

y  X".  ->  t  ( t)  ,  asn-M"  ,  for  tefO.l]  , 

i  =  l  11 


or 
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(3.19) 


T„ll> 


l  E -  _ (X  . 1  { | X  . |> 
J-l  n,.1  1  n,j  ' 


e)  £  0  ,  as  n  ->  «  ,  Vfr>0 


Tn(t) 


I  E.  j (X“  .)  -*  i(t)  ,  as  n  -*■  ®  ,  for  te[0,l] 

j  =  l 


or 


(3.20)  (3.19)  holds  for  one  e>0,  and  max  jx  .  |  ->■  0  ,  as  n 

l<i<x  (1)  n’3 

n 


Then 


Sot  °  Box  as  n  -*■  00  ,  in  D  f 0 , 1  ] 


Proof:  Assume  first  (3.18)  is  satisfied.  Then  (  max  lx  .1)  ,  i ; 

-  .  . 1  n. ) '  n=] 

integrable  and 


l<j<Tn(l) 


Tn(n 

l  |x  • |1{|X  ,  I >1 )  +  0  ,  as  n  - 

j  11  >  J  11  >  J 


Since  (X  .}  is  a  m.d.a. ,  it  follows,  using  Lemma  2.5(iil  that 

n,  l 


Tn(I) 


Tn(1) 


1}) 


Tn(i) 


<  l  E.  ( |x  .  1 1  {( X  . !>ll) 

j-l'  n,  i  n,j  1 


■>  0  ,  as  n  -*■  00  , 


and  thus  the  conditions  of  Theorem  3.3  are  satisfied,  and  Sot^  -*  Bot 
Next,  assume  (3.19)  holds.  By  Corollary  3.4,  it  follows  that 


DO 


uni formlv 


as  required. 
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T  ( t ) 

nr  2  P 

)  X  .  -*  T(t)  as  n  -*■  «»  ,  for  tc  [0,1]  , 

1  =  1  n  ’  1 


and,  attain  using  that  (X^  .}  is  a  m.d.a.. 


vn 


V1) 


|Ui-l(XnJl{!Xnjl^l  5  lsl  'E, 


V” 


<  l  E.  (X  ,1{|X  . |>l}) 

jij  J-l  n,.i  n,  i  1 


hv  assumption,  so  again  the  conditions  of  Theorem  3.3  are  satisfied,  and 


<•  d  „ 

Sot  v  Bot. 
n 


finally,  by  Corollary  3.4(ii),  the  conditions  (3. 191  and  (3.20)  are  equivalent, 
so  the  result  holds  also  under  (3.20).  n 

Corollary  3.6.  Suppose  (X_  ,  ,8_  ,)  is  a  m.d.a.  and  for  each  n.  x  is  a  stonnine 

n ,  .1  n ,  j  n  1 

time  with  respect  to  8^  J(  8^  2"  '  '  and  suPPose  one  °f  the  following  three  sets 
of  conditions  holds: 


(3.21) 


F.  max  X  .  |  -*■  0  , 
l<j<T„  n’J 


v  2  P 

l  K  i  . 


as  n  -►  °° 


as  n  4  00  , 


l  E  .  (X2  . 1 { | X  .  |>e})  £  0  , 

i  =  l  1-1  n>.i  n.i  1 


as  n  -v  00  ,  Ve>0  , 


(3.22) 


Z  E 1 


as  n  ■*  ®  , 
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(3.23) 


(3.22)  holds  for  one  t;>0,  and 


lx  .  I  K  o  . 


!<)<!, 


(3.21) 


I  Xn  ;  -  N(0,1)  , 


as  n  -*  °° 


Proof:  To  prove  the  result  assuming  (3.21),  define  adapted  timescales 

^  CO  _  | 

v  ( t)  ;  tc  f  0, 1 1  /  ,  (similar  to  p  (t)  in  the  proof  of  Lemma  3.2)  bv 

n  n  =  l  n 


(t)  =  inf{k;  f  X“  . > t } at  ,  for  0<t<l  , 


n,  j  n 


t  (1)  =  t  . 
n  n 


It  is  casilv  seen  that  {r  (t)}  satisfies  the  condition  (3.18)  of  the  theorem  (cf. 

■  n 

the  proof  of  Lemma  3.2),  so  in  particular  Sot^fl)  ^  B(l),  which  is  iust  another 
way  of  writing  (3.24). 

The  proof  unde"  (3.22),  or  the  equivalent  condition  (3.23)  is  similar;  one 
2  1 

iust  has  to  replace  X  .  bv  E.  .(X"  .)  in  the  definition  of  i  (t) . 

n,j  J-l  n,j  n 

We  conclude  this  section  with  several  comments  on  the  results. 

(i)  In  reasonable  circumstances  the  conditions  are  also  necessary  for  the  func¬ 
tional  martingale  central  limit  theorem.  In  fact,  if  {maxfjx^  ,j;  l<j-7n(l)H 
is  uniformly  integrable,  and  if  "Tn  takes  all  relevant  values"  (see  | 19)  for  a 
def init ion--this  holds  in  particular  if  x  (t)  only  has  jumps  of  size  one,  as 
e.g.  when  x  (t)  =  [nt)),  then  Soxn  ^  Box  implies  (3.18),  see  [19].  Easy  examples 
show  that  neither  uniform  integrability  nor  "Xn  takes  all  relevant  values,"  can  he 
entirely  dispensed  with  in  this  statement. 


j  -S  ■ 


» 
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T  (11 
v  n  2 

Similarly,  it  limsup  I:  X  .  <  u  ,  then  Sot  implies  (3.10)  and  the  e<iui- 

n  **'  1  n  *  -1  11 

valent  condition  (3.20),  see  [7],  [19].  Furthermore,  {max(|x  .1;  l<i<T  ( 1 )  }  V 

n , .)  n  n  =  1 

then  is  uniformly  intcgrahle,  and  (3.18)  follows  from  (3.19)  and  (3.20)  by  Coro] - 
1  ary  8.1. 

(ill  One  important  special  case  of  the  theorem  is  the  degenerate  one.  when 

:(t)  "  0.  From  the  theorem,  if  {max{|X  I;  lsi<T  }  ,  is  uniformly  integrahle. 

n, inn-1  * 

_  n  t  p  ^ 

and  )  X  >  0,  for  some  sequence  {t  }  ,  of  stopping  times,  then 

.  I  n ,  i  n  n=  1  1  '  *  ’ 


t  3.J8)  sur  ’  }  X  .|*0,  as  n  -*  ■  , 

,  ,  .  ,  n ,  i 

l-=k<i  1  =  1 
n  - 

:utd  conversely,  if  (3.23)  holds,  and  {max  { |  X^  |  ;  is  uniformly  inferra¬ 

ble,  then  Y,n  X-  .  ^  0. 

L 1  n ,  .i 

Similarly,  if  ,n,  F.  ,  (X“  . )  -*  0  ,  then  (3.23)  holds,  and  conversely,  (3.23) 
i  =  1  j  - 1  n ,  j 

and  limsup  E  X“  .  <  03  implies  that  E._.(X2  •)  +  °- 

n-*x  i=ii-in,i 

(iii)  If  the  Xn  . 's  in  each  row  arc  independent,  and  the  Tn(t)'s  are  non-random, 

then  for  C  .  =  o(X  X  .)  the  second  part  of  (3.19)  ]ust  says  that  the 

n.j  n , 1  n,  j 

T  (t)  2 

that  the  normalization  is  such  that  VQ  X“  .)  -*■  x(t)  ,  and  the  first  part  is 

j=l  n'7 

I.  i  ndebi-rg '  s  condi  t  ion  . 

(iv)  There  is  another  important  special  case  in  which  the  conditions  of  the  theorem 

are  particularly  easy  to  check:  if  the  X  .'s  are  obtained  by  normalizing  a  single 

n ,  j 

stationary  ergodic  sequence  ...X  X^,  X^,..., 

X.  -  E(X.  I!  8.  .) 

x  .  =  ~2 - i!L±±-  , 

"’-1  a/n 

with  6  =  8  .  =  o(...,  X  ,  X  )  and  a2  =  F.(X.  -  E(X.||8.  . ) )  “  assumed  to  bo 

11  J-1  J  J  J  .1  - 1 


Vs-  * 
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strict  1 y  positive  and  finite,  then,  for  t  (t)  =  [nt]  ,  it  follows  at  once  from 
the  ergodic  theorem  that  (3.19)  holds  and  hence 


T  (X.  -  F.(X.  ||  B  .  ))/  /n  >  oB(t)  . 

i  =  1  •'  - 


In  nonstationary  cases,  the  conditions  of  the  theorem  often  have  to  he  checked 
by  computing  higher  moments,  e.g.  the  first  parts  of  (3.181,  (3.19)  follow  if 


E  V  X*  .  0  ,  as  n  +  11  ,  for  some  a -1  , 


and  the  second  pa~fs  of  (3.18),  (3.19)  may  be  obtained  lay  computing  means  and 
variances  of  the  sums  on  the  left  hand  sides. 

(v)  Throughout,  we  have  (implicitly)  assumed  that  T^d),  and  hence  t  (t)  for  t-1 

is  finite  a.s.  However,  a  small  further  argument,  using  (ii)  above  shows  that 

this  can  be  dispensed  with,  and  that  the  theorem  (and  the  corollary)  holds  also 

if  >n(i)  (or  x  )  are  extended  stopping  times  which  may  be  infinite  with  positive 

T  (1) 

probabilities  provided  ^  .  converges  with  a  probability  which  tends  to 

one  (this  is  automatic  under  (3.19),  (3.20)). 

(vii)  It  is  obvious  from  the  proo*'  of  Droposition  2.2(i)  that  the  second  n»i-t 

T  ( •)  2  d 

of  (3.18)  can  be  weakened  to  requiring  that  £_._j  t(*)  in  Do[0,l],  with 

t(0  non-random,  but  possibly  discontinuous,  and  similar  remarks  apply  to  (3.19) 
and  (3.20). 


4.  Mixing  and  ^ltidimensional  Processes 

In  this  section,  the  convergence  in  the  previous  results  will  be  strengthened 
to  Renyi -mixing--which  in  turn  will  make  it  possible  to  remove  the  condition  that 
T ( t)  is  non-random--nnd  multivariate  versions  of  the  results  will  be  obtained. 
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Wh ili'  the  purpose  of  the  previous  sections  is  to  provide  complete  proofs, 
stort inn  from  scratch,  of  the  basic  results  of  martingale  central  limit  theory, 
the  intention  of  the  present  section  is  only  to  indicate  one  possible  direction 
for  developing  the  results  further- -examples  of  other  directions  being  provided 
by  limit  theorv  for  continuous  parameter  martingales  and  diffusion  approximations-- 
and  we  will  accordingly  give  a  more  sketchy  development,  sometimes  leaving  details 
of  arguments  to  the  reader,  and  referring  to  results  from  other  areas  as  they  are 
needed,  rather  than  explicitly  collecting  them  at  the  beginning. 

Some  further  notions  are  needed  for  the  results.  As  in  Section  2,  let  X, 

!\  -  he  random  variables  in  a  complete  separable  metric  space  (S,p),  and  in 
addition  assume  that  all  the  X  's  are  defined  on  the  same  probability  space 
(  ,8,1’).  Then  iX  '  is  Penyi-nixing  (or  just  nixing)  with  limit  X,  X  +  X  (mixing) 
if  X  •  X  in  IS,;"),  with  respect  to  the  conditional  probability  P(*|B),  for  any 
B- 8  with  P ( B ) > 0 .  Further,  {X^i  is  Renyi-stable  (or  just  stable)  if  Xn  converges 
in  distribution  to  some  limit,  with  respect  to  P ( - [ B) ,  for  any  BeB  with  P(B)>0. 

Thus  a  mixing  secpience  is  stable,  and  conversely  if  a  stable  sequence  has  the 
same  limit  with  respect  to  P(*|B),  for  all  B,  then  it  is  mixing.  Loosely 
speaking,  a  sequence  is  mixing  if  it  converges  in  distribution  and  is  "asymp¬ 
totically  independent"  of  any  fixed  events.  Prominent  examples  of  sequences 
which  are  stable  but  not  mixing  is  given  by  sequences  which  converge  in  proba¬ 
bility,  or  almost  surely,  to  a  non-degenerate  limit.  Some  indication  of  the  use 
and  interest  of  mixing  is  given  by  the  fol lowing  result . 

Proposition  4.1.  The  following  three  assertions  are  equivalent: 

(i)  Xn  -+  X  (mixing)  , 

(ii)  X  +  X  ,  as  n  +  ®  ,  with  respect  to  P ( • | B)  for  all  Be8.  with  P(B)>0 

n  U 

for  some  algebra  BQ  which  generates  o(Xj,  X^,...)*  and 
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.  .  uu 

(iii)  if  Y,  {Y  }  ^  are  random  variables  in  another  complete  separable  metric- 

space,  with  Y  +  Y,  as  n  +  then  (X  ,Y  }  +  (X,Y) . 
n  n  n 

Sketch  of  proof.  Clearly  (i)  is  equivalent  to 

(4.1)  Eh(X  )1D  ->  Eh(X) P(B)  ,  as  n  +  “  , 

n  d 

for  any  bounded  continuous  function  h:  S  -*■  H  and  event  Be8,  and  similarly  for 
(ii),  with  8  replaced  by  8^.  Clearly  (i)  implies  (ii).  Further,  to  any  c>0  and 
BcB  there  exists  a  B££8q  with  P(BAB£)  <  z,  cf.  [5],  p.  606,  and  hence  if  |h|  <  C, 
then 

|Eh(X  )1D  -  Eh(X  )1  I  s  Ce  . 

1  v  n  B  n  e1 

D 

This  is  easily  seen  to  imply  (4.1),  if  Eh(X  )  1  Eh(X)P(BC),  for  all  f>0,  and 

n  B 

hence  ( i i)  implies  that  (4.1)  holds  for  B  £  o(X^ ,X? , . . .) .  The  case  of  general 
B eL  then  follows  by  a  further  small  argument,  as  in  [1], 

It  is  straightforward  to  see  that  (iii)  holds  if  and  only  if  (Xn<Y)  ^  (X.Y) 

‘or  any  fixed  random  variable  Y  in  (S',p'),  and  this  in  turn  is  equivalent  to 

(4.2)  Eh(Xn)l{YeB)  £  Eh(X)P(B)  ,  as  n  -*  »  , 

for  any  continuous  bounded  h;  S  ■*  F  and  any  Y-continuity  set  B,  cf.  [3],  p.  2('. 
Obviously  (i)  implies  (4.2),  and  conversely,  for  an  arbitrary  event  BeB,  taking 
Y  =  1^  and  B  =  (1/2, 3/2),  say,  (4.1)  follows  from  (4.2).  Hence  (i)  and  (iii)  are 
equivalent.  1 ; 

Additional  interesting  properties  of  Renyi-mixing  is  that  it  is  preserved 
under  absolutely  continuous  change  of  measure,  and  that  it  implies  that  the  sample 
paths  fluctuate  strongly,  see  [1,17,19].  Furthermore  it  should  be  noted  that, 
with  obvious  changes  only.  Lemma  4.1  holds  also  if  mixing  convergence  is  replaced 
by  stability.  Using  Proposition  4.1(ii)  it  is  easy  to  see  that  the  limits  of  the 
previous  section  are  mixing. 
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Theorem  4.2.  If  the  o-algebras  {8  .}  are  nested  ,  i .e.  if  8  ■  c  B  ,  .  .  for 

all  n,  j,  then  the  conclusions  of  Theorems  3.3  and  3.5  can  he  strengthened  to 
mixing  convergence,  i.e.,  under  the  same  hypotheses,  Sot^  +  Box  (mixing),  and 
similarly  for  Corollary  3.6. 

Proof :  the  proofs  under  the  different  hypotheses  are  all  similar,  so  we  will 
only  give  one  as  an  example,  say  the  first  part  of  Theorem  3.5. 

Thus,  we  will  assume  (3.18)  holds.  According  *0  Proposition  4.1  it  is  suf¬ 
ficient  to  show  that  if  B  is  a  fixed  event  in  the  algebra  u  ,  .  ,  8  .  with 

6  n>l,j>l  n,: 

P(R)>(),  then 

(4.3)  Sox  ^  Box  .  as  n  ->  00  ,  in  D[0,1]  , 

with  respect  to  P(*]b).  Since  B  e  u  8  .  ,  there  are  n  , j  with 

ri— ljj  —  i  n ,  j  u  u 

Be8  .  ,  and  since  the  8  .'s  are  nested  and  increasing,  it  follows  that 

n0,J0  11,3 


B  e  8 


n,J 


for  n  >  n 


0 


T  (t) 


Let  X'  .  =  X  .  1  {j>in}  and  write  S  'ot  (t)  =  T.™  X'  . 
n, j  n,j  • n  n  '  j=l  n, j 


j  *  J 


o  • 


Then 


sup  ! SOt  ( t)  -  S'OT  (t)  |  <  l  | X  ,|  <  j  max  |x  | 
0<t<l  n  j=l  n’3  u  l<j<Tn(l)  n’3 


-  0  , 

in  P-prcbabil ity,  and  thus,  as  is  immediately  seen,  in  P( • |B) -probability.  Hence 
by  Proposition  2 . 2 ( i i ) ,  (4.3)  follows  if 


(4.4)  S'ox^  +  Box  »  as  n  >  03  ,  in  D[0,1]  , 

under  P(*|b).  Clearly,  (X^  j,  . ;  nsn^,  j>l)  is  a  P( • |B) -m.d.a. 


E(  max  |X’  |  |B)  <  E  max  |x 

1<  j^T  (1)  ’3  PtB)  l<j<T  (1)  11,3 


0  ,  as  n  +  ®  , 


•  liAnjjT-' 
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Tntf)  T  (t)  j  AX  (t) 

n  n  ~  On 

l  (X*  ,)2  =  I  x2  .  -  y 
n,i  n,j  .L. 

.1  =  1  j=l  j  =  l 


-*■  T(t)  -  0  =  t ( t)  , 

in  P--and  hence  in  P( • | B) -probability,  and  thus  (4.4)  follows  from  the  first  part 

of  Theorem  (3.5),  applied  to  (X1  .,8  .}.  '' 

11  n, j  n,  j 

For  the  extension  to  several  dimensions  it  will  he  useful  to  have  a  slightly 
different  description  of  the  limit  process  Box,  and  to  emphasize  this  we  will 
change  notation  and  will  in  the  sequel  write  instead  of  Box.  Clearly,  B^  can 
be  characterized  as  the  normal  process  which  has  mean  zero,  variance  function 
t( t)  and  independent  increments.  Further,  of  course,  B  =  Box  makes  sense  also 
if  t  is  a  stochastic  process,  and,  in  particular,  if  x  is  independent  of  B,  which 
we  will  assume  throughout,  then  B^  can  be  described  by  saying  that  conditional  on 
X,  B^  is  normal,  with  zero  mean,  variance  function  x(t),  and  with  independent  in¬ 
crements.  Similarly,  given  a  nonnegative  definite,  nondecreasing  matrix  valued, 

possibly  random,  function  x(t)  =  (x.  . (t)  ;  l<j,k<d),  we  define  a  d-dimensional 

J  > A 

process  =  (B^  , . .  .  ,B^)  by  requiring  that  conditional  on  x(t)  it  is  a  normal 
process  with  mean  zero,  variance  matrix  V(BT(t)||  x)  =  x(t),  and  with  independent 
increments.  We  then  have  the  following  extension  of  Theorem  3.5.  (Theorem  3.3 
has  the  analogous  extension--this  is,  however,  left  to  the  reader). 

Theorem  4.3.  Suppose  (X  .,8  .}  is  a  d-dimensional  m.d.a.,  i.e.,  that  X  .  = 

-  n,j  n,j  n,.i 

(X(l5,...,xf«)T  are  d-dimensional  random  vectors,  such  that  (X^.,8  .)  is  a 

n ,  i  n ,  j  n , j  n ,  i 

m.d.a.,  for  k  =  1,2,  ...,d,  that  {x  (t) ;  te[0,l]}  are  adapted  timescales,  and 
that  T ( t)  =  (x.  , ( t) ;  l<j,k<d)  is  a  continuous,  possibly  random,  matrix  function. 

j ,  k  • 

(i)  If  one  of  the  following  three  sets  of  conditions  holds, 

E  max  |  X<k>  |  -*■  0  ,  as  n  ->  n>  ,  for  k  =  l , .  . .  ,d, 

1 - j - T  ( 1 )  n>J 


W»  * 
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(4.5) 


Tn(t) 

}.  X  .x]'  £  x(t)  ,  asn->«"  ,  for  te[0,l] 

»  J  **  »  J 


or 


TnU) 


J  E.  1((X(k).)2l{|x(k5.|>-1'P 
j'-j  J-l  n.J  n ,  j 


e})  0  ,  as  n  -v  <»,  v  e>0,  and  for  k=l , .  .  .  ,d. 


(4.6) 


Tn(t) 


y  E.  .(X  .X1  .)  4-  x(t) 

j=l  I'1  n>J  n'J 


,  as  n  ->  00  ,  for  te  [0,1]  , 


or 

(4.7)  (4.6)  holds  for  one  e>0,  and  max  |X^. )  +  0  ,  as  n-*»,  for  k=l,...,d, 

l<j<Tn(l)  n*  ^ 

and  if  in  addition  the  o-algebras  {8  .}  are  nested,  then 

J 

Sotn  d  Bt  ,  asn  +  ”  ,  in  D[0,l]d  , 

and  the  convergence  is  stable:  If  Yn  £  Y  in  (S’,p'),  then  (Sox^Y^)  d  (B^,Y) 
in  D[0,l]d  x  S',  where  the  distribution  of  (B^.Y)  is  determined  by  the  require¬ 
ment  that  B  and  Y  are  independent,  and  that  the  distribution  of  (x,Y)  is  the 

^(*3  T  (•) 

limit  of  the  distributions  of  (J.71.  X  .XT  . ,  Y  )  or  of  (T.11,  E.  ,  (X  ,XT  .)  ,Y  ), 

Lj  =  l  n,;  n,r  n'  vtj=l  j-P  n,j  n,jJ  ’  nJ  ’ 

respect ively. 

I i i )  If  in  (i)  the  limit  x(t)  is  non-random,  then  the  hypothesis  that  the  B  ,'s 

a » J 

d  k 

arc  nested  can  be  deleted,  and  it  still  follows  that  Sox  -*•  B  in  DfO.ll  .  but 

n  x  1 

the  convergence  is  not  necessarily  stable. 


Proof:  Suppose  first  that  d=l  and  that  (4.5)  holds.  Without  loss  of  generality 

we  may  as  in  Lemma  3.2  assume  that  the  m.d.a.  (X  .,8  .}  satisfies  E  maxlx  .  Uo 

n,j  1  n,j  1 

vk  2  a  s 

as  n  -*•  °°,  and  that  ,  X  .  i  ®,  as  k  ->  ®  for  each  n.  We  will  now  proceed  as 

3=* 

in  Lemma  3.2,  defining 
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■  l 


(4.8) 


j=l 


n,] 


n^!(t)  =  inf (s>0;  nn(s)>t}  . 

It  follows  at  once  from  Theorem  4.2  that  Sor^1  +  B  (mixing)  in  D[0,1],  Clearly 
Theorem  4.2  can  be  immediately  translated  to  D[0,T],  for  any  T>0,  and  it  follows 
that  Sori^1  +  B  (mixing)  in  D[0,T],  for  any  T>0,  which  in  turn  implies  that 
Sort^1  ^  B  (mixing)  in  D[0,°°),  cf.  [21]  .  By  assumption, 


n  ox 
n  n 


Tn(t) 

=  I 

j=l 


T(t) 


and  if  in  addition  Y  -*■  Y  ,  then 

n 


(Sop  ,  n  ox  ,  Y  ) 
n  n  n  n 


by  Proposition  4.1, 
+  (B.t.Y)  , 


where  B  is  independent  of  (x,Y).  By  a  minor  extension  of  Proposition  2.2fi),  it 
follows  that 

(Son_1on  or  ,  Y  )  +  (B  ,  Y)  , 
n  n  n  n  x*  ’  ’ 

and  then  since 

sup  I  SOT  ft)  -  Son'1 on  OT  (t) I  <  max | X  . | 

0<t<l  l<j  n’J 

p  n 

->0  ,  as  n  “  , 

the  desired  result  follows,  that 

(Soxn,  Yn)  ^  (Bt,  Y)  ,  as  n  +  “  , 

this  time  by  a  small  extension  of  the  second  part  of  Proposition  2.2  . 

Still  assuming  (4.5),  {Sox^}  is  tight  also  for  d>l,  since  by  what  just  has 
been  proved  its  components  converge  in  distribution,  and  hence  are  tight.  Thus, 
similarly  for  convergence  in  D [ 0 , 1 ] *^  as  in  D[0,1],  it  only  has  to  be  shown  that 
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the  finite  dimensional  distributions  converge.  Equivalently,  as  is  easily  seen 
by  considering  multidimensional  characteristic  functions,  it  is  sufficient  to 


prove  that 
(4.9) 


J  u(t)dSotn(t)  +  /  u(t)dBT(t) 


as  n  -*■  °°,  if  u(t)  is  non-random,  piecewise  constant  and  with  only  finitely  many 
jumps  (this  is  just  the  Cramer-Wold  theorem,  [3],  p.  49).  However,  clearly  the 
left  hand  side  of  (4.9)  is  a  sum  of  martingale  differences  and  the  convergence 
follows  easily--though  with  some  notational  qualms--from  what  has  been  proven 
for  the  case  d=l . 


The  proof  of  part  (.i)  under  the  hypotheses  (4.6)  or  (4.7)  differs  from  the 
above  in  only  one  place--instead  of  defining  n  ,  by  (4.8)  it  is  convenient  to 


*  j, 


Finally,  the  one-dimensional  version  of  part  (ii)  is  just  Theorem  3.5,  and 
the  multidimensional  version  is  then  obtained  in  precisely  the  same  way  as  above.  H 

Corollary  4.4.  Suppose  (X  .,8  . }  is  a  d-dimensional  m.d.a.,  {x  }  is  a  sequence 

- -  "  n,  j  n,j  n 

of  stopping  times,  and  x  =  (x.  ,  ;  l<j,k<d)  a,  possibly  random,  matrix. 

]  ,  K 

(i)  If  one  of  the  conditions  (4.5) -(4.7)  holds,  with  x^(l) ,  x  (t)  replaced  by 

x  ,  and  x ( t)  replaced  by  x,  and  if  the  a- algebras  {8  .}  are  nested,  then 
n  Ti  f  J 


11  a  ~ 

l  X  .  +  B 
j  =  l  ^  1 


as  n  -*•  00  , 


where  conditional  on  x,  B  is  normal  with  mean  zero  and  variance  matrix  x.  If 

x 

furthermore  x  is  strictly  positive  definite  a.s.  and  the  modified  version  of 


(4.5)  holds,  then 


(  j  X  ,XT  .) _1/2  J  X  .  2  N , ( 0 , 1 )  , 

n.j  n, j  n, j  d 


as  n  +  “>  , 
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and  if  the  modified  version  of  (4.6)  or  (4.7)  holds,  then 

j = i  j=i 

(ii)  If  t  is  non-random,  then  the  hypothesis  that  the  a-algebras  are  nested  can 
be  deleted. 

Proof:  The  first  part  of  (i)  follows  from  the  theorem  in  a  similar  way  as  Corol¬ 
lary  3.6  follows  from  Theorem  3.5,  after  reducing  the  problem  to  the  case  d=l  bv 

d  T  f 

considering  linear  functionals,  £  u,  X  .  (the  Cramer -Wold  theorem).  The 

k=l  k  j-i  n>3 

second  part  is  then  immediate  after  noting  that  stability  implies  that,  e.g.  under 
the  modified  version  of  (4.5),  there  is  joint  convergence, 


n  T  n  j 

(  y  x  .x1.,  y  x  .)  s  (T,b  )  , 

j,j  ".3  n,j  n,j 


and  hence 


(  I  X  ,XT  .) 'l/2  l  X  .  +  t'1/2B 
n>3  ">3  n.J  T 

=  Nd(0,I) . 

The  proof  of  part  (ii)  is  similar. 

Clearly  the  remarks  (iii)-(vii)  after  Corollary  3.6  apply, 
also  to  the  present  situation.  Further,  as  a  final  remark,  considering  e.g.  d=l 
and  Condition  4.5,  the  requirement  that  the  sums  of  squares  converge  in  probabil¬ 


ity,  and  not  only  in  distribution  is  only  used  to  insure  asymptotic  independence, 

T  ( •)  2  ! 

so  that  the  marginal  convergences  X  .  -+  t  and  Sor^  -v  B  imply  joint  con¬ 


vergence 


(4.10) 


V> 


(  I  X*  ,,  son'1)  i  (t,  B)  , 


n,  3  n 


as  n  -*•  00  , 


where  t  and  B  are  independent,  and  hence  convergence  in  probability  can  be  re- 


placed  by  any  weaker  condition  which  still  insures  (4.101.  One  set  of  such  con¬ 
ditions  is  given  in  Theorem  3.4  of  Hall  and  Heyde  [8], 

APPENDIX 

Here  we  will  prove  the  simple  special  case  p=4  of  the  Burkholder  inequality 
stated  in  Proposition  2.4(ii),  which  is  the  only  case  needed  for  this  paper.  In 
fact  the  proof  of  finite  dimensional  convergence,  p.  12,  l  4  also  uses  p=3,  but 
that  might  as  well  have  been  reduced  to  p=4  by  first  using  the  Liapunov  inequality 

ElASOx  (£)!3  <  (EASOt  (£)4}3/4  . 

1  n  1  n 

Thus  let  {Xj,6^}  be  a  martingale  difference  sequence.  We  will  start  by  as¬ 
suming  that  Doob's  inequality 
k  .  n 

(A.  1)  E  max  |  J  X.  |  <  C'E(  £  X.) 

l<k<n  j=l  }  j=l  J 

is  known,  and  prove  that  then 

(4.2)  H(  l  X.)4  <  CE(  T  X2)2  , 

3=1  3  3=1  • 

where  (' '  and  C  are  universal  constants. 

I.et  S(n)  =  X.  and  M(n)  =  max{|^_1  X.|;  l<k<n).  By  the  martingale  dif- 

3  3 

ference  property,  ES(j)  =  E{S(j)  E(X^+j||  B^)}  =  0,  so  that,  expanding  S(i+1) 

4 

(S(jl+Xj+1)  we  have 

ESO  +  114  -  ES( jl 4  =  EX4  +  1  +  4ES(jlX3+]  +  6ES(j12X2+1  , 
nd  thus,  summing  over  j, 

(A. 3)  ES(n) 4  =  l  EX4  +  4  £  ES(j-l)X?  +6  £  ES(j-l)2Xt  . 
j  =  l  -1  3  =  1  -1  3  =  1 

Inserting  the  obvious  inequalities  ]S(j-l)|  s  M(n)  and  |x^|  <  2M(n)  into  (A. 3) 
wc  obtain 

A  2  n  2 

(A. 4)  ES ( n ) 4  <  (4+8+6) E(M(n)  l  X.). 

j=l  3 
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Now,  by  the  Cauchy-Schwarz  inequality  and  by  (A.l) 


EM(n) ' 


l  XZ  <  (EM(n)4}1/2{E(  l  XZ)Z} 
j=l  J  i=l  -1 


,2,2, 1/2 


<  {C'ES(n)V/2{E(  l  X?) 2} 1/2  , 
j=l  J 

4  1/2 

and  inserting  this  into  (A. 4)  and  dividing  through  by  {ESln)  }  '  "  we  obtain  (A. 21 
2 

with  C  =  18  C’  (which  is  not  the  best  possible  value  of  C,  cf.  [4]). 

The  general  assertion  of  Proposition  2.4(ii),  for  p=4 ,  now  follows  easily  by 
replacing  X_.  by  Xj1{t<X.<t'}  in  (A. 2)  and  letting  n  tend  to  infinity,  using 
Fatou's  lemma  on  the  left  hand  side.  (In  fact  the  proof  works  not  only  for  p=4 , 


but  for  any  even  integer  p.) 
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